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Abstract. 

We have measured the 3.6 /an luminosity evolution of about 1000 galaxies in 32 clusters at 0.2 < z < 1.25, 
without any a priori assumption about luminosity evolution, i.e. in a logically rigorous way. We find that the 
luminosity of our galaxies evolves as an old and passively evolving population formed at high redshift without any 
need for additional redshift-dependent evolution. Models with a prolonged stellar mass growth are rejected by the 
data with high confidence. The data also reject models in which the age of the stars is the same at all redshifts. 
Similarly, the characteristic stellar mass evolves, in the last two thirds of the universe age, as expected for a stellar 
population formed at high redshift. Together with the old age of stellar populations derived from fundamental 
plane studies, our data seems to suggest that early-type cluster galaxies have been completely assembled at high 
redshift, and not only that their stars are old. The quality of the data allows us to derive the LF and mass 
evolution homogeneously over the whole redshift range, using a single estimator. The Schechter function describes 
the galaxy luminosity function well. The characteristic luminosity at z = 0.5 is is found to be 16.30 mag, with an 
uncertainty of 10 per cent. 



Key words. Galaxies: luminosity function, mass function 
clusters: general 



Galaxies: evolution — Galaxies: formation Galaxies: 



1. Introduction 

The luminosity function (LF) is the basic statistic used 
to understand galaxy properties, giving the relative fre- 
quency of galaxies of a given luminosity in a given vol- 
ume. Most additional parameters determined for samples 
of galaxies having more than a single value of luminosity 
are usually averages weighted by the LF (in addition to 
underlying selection effects). Furthermore, by comparing 
the LF at different redshifts or environments it is possible 
to infer how galaxy luminosity evolves. 

Since starlight at 3.6 /im very nearly follows the 
Rayleigh- Jeans limit of blackbody emission for T > 2000 
K, the colors of both early- and late-type stars are similar. 
There is virtually no dust extinction at this wavelength 
either, since any standard extinction law predicts only a 
few percent of the extinction of optical wavelengths. The 
3.6 /mi light therefore traces the stellar mass distribution 
free of dust obscuration effects (Pahre et al. 2004). Thus, 
a useful approach to understanding how galaxies form is 
to track their growing stellar mass, measured through the 
evolution of the 3.6 fim LF. 

Several previous studies addressed the luminosity evo- 
lution of galaxies in clusters in near- infrared bands, no- 
tably de Propris et al. (1999). They found that the Zf-band 



LF of 38 clusters up to z = 0.92 is consistent with the 
behavior of a simple, passive luminosity evolution model 
in which galaxies form all their stars at high redshift and 
thereafter passively evolve. However, and perhaps because 
a standard cosmology was not in place at the time of that 
work, the authors do not address which of the other pos- 
sible evolutionary scenarios are rejected by the data and 
at what confidence. Kodama & Bower (2003), Kodama et 
al. (2004) found results compatible with little evolution in 
stellar mass, but with large uncertainties. 

The de Propris et al. (1999) results were anticipated 
(and also later confirmed) by the analysis of sub-samples 
of cluster galaxies: early-type or red galaxies have proper- 
ties (mainly colour and location in the fundamental plane) 
consistent with a passive evolution model. However, red 
or early-type galaxies are biased subsamples of the whole 
galaxy population and are affected by the progenitor bias 
(van Dokkum & Franx 2001). Furthermore, clusters of 
galaxies are also know to host star forming galaxies (e.g. 
Butcher & Oemler 1985). The fraction of blue galaxies 
growing with redshift (see e.g. Butcher & Oemler 1985; 
Rakos & Shombert 1995; but see Andreon, Lobo & Iovino 
2004 for a different opinion) makes results based on the 
red/early- type population less representative of the whole 
population as redshift increases. LF studies do not suf- 
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fer from the progenitor bias, and they directly approach 
the more fundamental problem of studying the evolution 
of the whole sample of galaxies. Therefore, LF studies 
are preferable to measure the ensemble mass evolution. 
Furthermore, stellar populations may be old, as pointed 
out by several colour or fundamental plane studies, but at 
the same time galaxies may not have been completely as- 
sembled: since z = 1 and the present-day, galaxies might 
have grown in mass through mergers. This key issue can 
be tested by comparing the mass function of galaxies at 
different rcdshifts. 

The paper is organized as follow: Sec 2 presents the 
data; Sec 3 describes the sample of studied clusters. Sec 
4 summarizes the method used to derive the LF. The LF 
and the mass evolution determination are computed in Sec 
5. Sec 6 and 7 discuss and summarize the results. 

Throughout this paper we assume Am = 0.3, Oa = 0.7 
and H = 70 km s" 1 Mpc -1 . 

2. Data & data reduction 

2.1. IR data 

IR data were obtained with the IRAC (Fazio et al. 2004) 
on the Spitzer Space Telescope (Werner et al. 2004). A 9 
deg 2 SWIRE (Lonsdale et al. 2003) field was imaged. The 
exposure time is 4 x 30s. 

The standard pipeline pBCD (Post Basic Calibrated 
Data, ver. 10.5.0) products delivered by the Spitzer 
Science Center (SSC) were used in this paper. These data 
include flat-field corrections, dark subtraction, linearity 
and flux calibrations. Additional steps included pointing 
refinement, distortion correction and mosaicking. Cosmic 
rays were rejected during mosaicking by sigma-clipping. 
pBCD products do not merge together observation taken 
in different Astronomical Observations Requests (AORs). 
AORs are therefore mosaicked together using SWARP 
(Bertin, unpublished), making use of the weight maps. 
Sources are detected using SExtractor (Bertin & Arnout 
1996), making use of weight maps. 

Star/galaxy separation is performed by using the stel- 
larity index provided by SExtractor, which, being the pos- 
terior probability based on a neural network, outperforms 
cuts (linear discriminators) in object parameter space 
(Andreon et al. 2000), as expected (e.g. Bishop 1995). We 
conservatively keep a high posterior threshold (class sta r = 
0.95), rejecting "sure star" only (class s t a r > 0.95) in or- 
der not to reject galaxies (by unduly putting them in the 
star class) , leaving some residual stellar contamination in 
the sample. This contamination is later dealt with statis- 
tically. 

We checked the star/galaxy classification using a 
0.3 deg 2 region deeply observed at Cerro Tololo Inter- 
American Observatory (CTIO) by taking images under 
sub-arcsecond seeing conditions (see sect 2.2). We com- 
pared the classification derived from 3.6 micron images 
with those of one of our CTIO data observed with sub- 
arcsec resolution. Only less than about 2 per cent of the 



objects classified as stars (using class sta r > 0.95) using 
IRAC images are actually resolved at the CTIO resolu- 
tion, and are therefore mis-classified. This 2 per cent of 
stars corresponds to 0.1 per cent of all the sources detected 
at 3.6 /im in the same field (and brighter than 18 mag), 
i.e. a negligible minority overall. Therefore, our posterior 
probability threshold (class s t a r > 0.95) does not reject 
galaxies by unduly putting them in the star class at an 
appreciable level. The residual stellar contamination from 
our choice of the posterior threshold is subtracted statis- 
tically, as in the optical (e.g. Andreon & Cuillandrc 2002) 
or the near-infrared (e.g. Andreon 2001). 

Images are calibrated in the Vega system, using the 
IRAC zero points provided by the SSC (and, in particular 
by G.Wilson 1 . 

From the inspection of the galaxy count distribution, 
the completeness magnitude at [3.6] is ~ 18 mag. Objects 
brighter than 12.5 mag arc often saturated. Therefore, 
from now on, only the range 12.5 < [3.6] < 18 mag is 
considered. In sec. 5.3 we check how results are affected 
by a potential incompleteness at the faint end. 

The average density of sources in a circle of the point 
spread area is around 0.004. Therefore, it is very unlikely 
that crowding is an issue in average density regions, and 
also in 10 to 100 times overdense regions, such as cluster 
cores. Only one of our clusters required an accurate setting 
of the SExtractor deblending parameters to split the few 
blended sources. 

2.2. Optical imaging data 

In this paper we used CTIO wide-field imaging to control 
the quality of Spitzer star galaxy classification, as men- 
tioned above. We adopt here part of the same imaging 
data used in Andreon et al. (2004a). In brief, optical R- 
and z'-band (A c - 9000A) images were obtained at the 
CTIO 4m Blanco telescope during August 2000 with the 
Mosaic II camera. Mosaic II is a 8kx8k camera with a 
36 x 36 arcminutc field of view. Typical exposure times 
were 1200 seconds in R and 2 x 750 seconds in z' . Seeing 
in the final images was between 0.9 and 1.0 arcsecond 
Full- Width at Half-Maximum (FWHM). Data have been 
reduced in the standard way (see Andreon et al. 2004a for 
details). Typical completeness magnitudes are R = 24.5, 
z' = 23 mag (5a) in a 3 arcsec aperture. 

3. The cluster sample 

The cluster sample studied in this paper consists of 32 
colour-selected clusters, all spectroscopically confirmed. 
The clusters were detected as spatially localized galaxy 
overdensities of similar colour, as described in Andreon 
et al. (2003; 2004a, b). The detection method used takes 
advantage of the observation that most galaxies in clusters 
share similar colours, while background galaxies have a 
variety of colours, both because they are spread over a 

1 http:/ /spider. ipac.caltech.edu/staff/gillian/cal. html 
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Fig. 1. Richness for our clusters. The dotted line marks Fi §- 2 " Redshift distribution of the studied clusters, 
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larger redshift range and because the field population is 
more variable in colour than the cluster one, even at a 
fixed redshift. All clusters but one were detected using 
R — z' colour; one cluster was detected using the z' — K 
colour because of its larger redshift (sec 3.2 of Andreon et 
al. 2005 for details about the latter cluster). After colour 
detection, the cluster nature of the studied clusters was 
confirmed with spectroscopic observations. The clusters 
are individually presented and studied in Valtchanov et al. 
(2004), Pierre et al. (2005), Willis et al. (2005), Andreon 
et al. (2004a, 2005a,b). 

Altough studied clusters are colour-selected, 29 out 32 
of the clusters are also x-ray detected, leaving only 3 clus- 
ters (at z = 0.49, 0.61 and at z = 1.02) with too faint x-ray 
emission to be detected with 10 to 20 ks XMM images. The 
detected x-ray emission guarantees that the studied clus- 
ters have deep potential wells and independently confirms 
the cluster nature of the studied objects. 

The studied cluster sample is not a volume com- 
plete sample, nevertheless it densely samples the explored 
Universe volume, up to z ~ 1. Assuming the local (Ebeling 
et al. 1997) Lx luminosity function, we found that the 
expected number of clusters with x-ray luminosity Lx > 
10 43 erg/s up to z « 1 and with more than 50 counts 
on 10 ks XMM images is about 15 per deg 2 . All 32 stud- 
ied clusters are in a contiguous 2.8 deg 2 area of the sky, 
and therefore the cluster number density is about 1 1 clus- 
ters per deg 2 . The high number density of clusters should 
make the studied sample somewhat representative of typ- 
ical clusters, making any bias of the studied sample small. 

In order to characterize the cluster richness, for each 
cluster we compute the number of galaxies inside a radius 
of 357 kpc (corresponding to 500 kpc in the old cosmol- 
ogy and in the nearby universe studied by Bahcall 1977) 



brighter than to* + 1.7 (corresponding to Bahcall ~ m.3 + 2 
for an a = — 1 Schechter function). The background con- 
tribution has been removed (marginalized) using Bayesian 
techniques described in Appendix B of Andreon et al. 
(2005). Here we quote the mean of the posterior and its 
rms as a measure of the cluster richness and its error, 
respectively, for a uniform prior (but results are similar 
using a Jeffreys prior). Clusters of Abell (1958) richness 
or 1 in the present-day Universe have Nq of about 15 to 
20 (Bahcall 1977). Figure 1 shows that two thirds of our 
clusters have No < 20 galaxies. Most of the clusters with 
No > 20 galaxies have large error bars, which make them 
consistent with No — 20 galaxies at two sigma. Therefore, 
most of our clusters are at the bottom of the Abell richness 
scale and are not rich systems. 

The 32 clusters are distributed in the redshift ranges 
in the following way (see also Figure 2): 13, 8, 5, 6 clusters 
are in the range: 0.25 < z < 0.40, 0.40 < z < 0.65, 0.75 < 
z < 0.90 and 0.99 < z < 1.25, respectively. 

4. The method: LF determination 

We do not attempt here, as in some previous approaches, 
to infer the luminosity function from an optically selected 
sample, because the latter option assumes the absence of 
very dust obscured objects: if galaxies are dust obscured 
enough to go undetected in the optical, their contribu- 
tion to the IR LF is deemed to be zero, when instead it 
may be relevant. Some previous authors were forced to as- 
sume the absence of very dust obscured objects, because 
they need spectroscopic redshifts, mostly acquired in the 
optical window, or because they need multi-colour optical 
photometry to determine a photometric redshift. Here, in- 
stead, we use the approach usually adopted for clusters, 
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Fig. 3. Stellar mass evolution for several stellar mass 
growth models (curves) and as derived from our data 
(shaded area). The region allowed by the data is part of 
the shaded green area derived in sec 5.2. 

where knowledge of the individual galaxy redshift is not 
used. 

In order to compute the LF we adopt two different 
methods. 

- For display purposes only, the LF is computed as the 
difference between the (binned) counts in the cluster and 
control field direction, as usual (e.g. Zwicky 1957, Oemler 
1973). Error bars are computed as the square root of the 
variance of the minuend, because the contribution due to 
the uncertainty on the true value of background counts is 
negligible. A negative number of cluster galaxies may oc- 
cur in the presence of a low cluster signal and Poissonian 
fluctuations, which leads to the unphysical result of nega- 
tive numbers of cluster galaxies when data are binned. 

- Second, we fit the unbinned galaxy counts, without 
any use of binned data or errors computed in the previ- 
ous approach. We are faced with the classical statistical 
problem of determining two extended (integral> 1) den- 
sity probability function, one carrying the signal (the LF 
of cluster members) and the other being due to a back- 
ground (background galaxy counts, BKG) from the ob- 
servations of many individual events (the galaxies lumi- 
nosities), without knowledge of which event is the signal 
(which galaxy is a member) and which one is background. 
Here, we follow the rigourous method set forth in Andreon, 
Punzi & Grado (2004, APG hereafter), which is an exten- 
sion of the Sandage, Tammann & Yahil method (1979, 
STY) to the case where a background is present, and that 
adopts the extended likelihood instead of the conditional 
likelihood used by STY. The method does not remove the 
background from the data, but adds a component (the 
background) to the model. The method also provide the 
normalization (at the difference of STY), needed to rig- 
orously combine the LF of the individual clusters, prop- 



erly accounting for the uncertainty in the LF normaliza- 
tions. 68 per cent confidence intervals are derived using 
the Likelihood Ratio theorem (Wilks 1938, 1963) made 
known to the astronomical community by Avni (1976) and 
Press et al. (1986), among others. The method comes in 
two forms: in sec 5.1 we neglect astronomical and statisti- 
cal subtleties, proceeding in the analysis as most previous 
published papers, whereas in sec 5.2 we take a fully rigor- 
ous approach, and we describe its advantages with respect 
to the simpler application. 



As the background field we considered a central 4 deg 2 
region for simplicity, and we fit the background counts 
with an arbitrary function. In this region, there are about 
106000 objects. The average background is therefore very 
well determined, because it is measured over a large area 
with respect to the cluster area. Its determination is so 
good that galaxy counts in the cluster direction (an area 
about 750 times smaller) does not constrain the back- 
ground counts at all. Therefore, it is justified here to keep 
the background parameters at the best fitting values ob- 
served in the control field (but see APG for why this is 
unjustified in general). 

For all but 2 clusters, we measured the LF within a cir- 
cle of 5 arcmin aperture. For the remaining two clusters an 
aperture of 3 arcmin is taken because of incomplete data 
coverage or because of a bright nearby star. This aperture 
is similar to the one used in the optical LF determina- 
tion for several clusters in common with Andreon et al. 
(2004a), and it has been chosen as a compromise between 
sampling the whole cluster and not including a too large 
contribution from background galaxies. 

4.2. Evolutionary models 

In order to estimate the expected apparent magnitude, 
absolute magnitude and mass to light ratio for different 
galaxy models having various growth histories we used 
GRASIL (Silva et al. 1998; Panuzzo et al. 2005), which 
is a code to compute the spectral evolution of stellar sys- 
tems taking into account the effects of dust, which absorbs 
and scatters optical and UV photons and emits in the IR- 
submm region. We adopt standard elliptical (E), Sa, Sb, Sc 
and Arp 220 (SB) models with default parameters (Silva 
et al. 1998): a Salpeter initial mass function is used, with 
lower/upper limit fixed to 0.15/120 M@. We assume that 
no stars are formed from z — oo to z = 5, 2, 2, 1.5, 1.5 
for E, Sa, Sb, Sc and SB models, respectively. The mod- 
els fully account for evolving metallicity and dust content 
with dust mixed to stars (see Silva et al. 1998 for details). 
Figure 3 shows model mass growth histories appropriate 
for an object having currently broad band spectrophotom- 
etry typical of E, Sa, Sb, Sc and star burst (SB) galaxies. 
The E model of stellar mass growth history is character- 
ized by the absence of recent stellar mass growth, while 
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Fig. 4. Composite LF in the [3.6] band as a function of apparent (lower abscissa) and absolute (upper abscissa) 
magnitudes. Data points and error bars are computed as usual (e.g. Oemler 1973, see sec 4). The solid and long 
dashed curves are fit to unbinned counts, but neglecting astronomical and statistical subtleties, as described in sec 5.1. 
The solid curve refers to a fit with to* free, whereas the dotted curve is a fit with m* held fixed to the value observed 
at 0.25 < z < 0.40. All these LF determinations are superseded by the LF determined in sec 5.2, which is a rigorous 
fit over all cluster and field data. 



later types display recent episodes of stellar mass growth. 
These stellar mass growth histories are not intended to 
represent the stellar mass growth history of an individual 
object, but only of the average class, which is why these 
curves are smooth, while the mass growth of individual 
object is more erratic. 

5. Results 

5.1. A simplistic approach 

We start with a simple analysis of the data, without sta- 
tistical and astronomical subtleties. We largely follow the 
usual astronomical method of LF computation in the field 
and also in the cluster when many clusters are available. 
First, we bin data in redshift bins. In this step we ignore 
that the redshift bin is of non-vanishing width (i.e. we 
overlook the required convolution of the model by the ap- 
propriate redshift kernel) and that sources likely brighten 
in their rest-frame due to the younger age of stars go- 
ing from the near to the far side of the redshift bin. We 



also overlook some statistical subtleties (each cluster has 
a Schechter function, not just the composite), and logical 
coherence (we are attempting to measure the evolution 
assuming its absence, as most previous works did, and as 
criticized by Andreon 2004). A fully rigorous analysis is 
presented in section 5.2. 

There are 260, 250, 100 and 240 cluster (i.e. back- 
ground subtracted) members inside the composite clusters 
at 0.25 < z < 0.40, 0.40 < z < 0.65, 0.75 < z < 0.90 and 
0.99 < z < 1.25, respectively. 

As a model for the cluster LF we adopt a Schechter 
(1976) function: 

&(m) = ( /,*io a4 ( Q + 1 )( m *- m ) e - lo0 - 4(m " m) (1) 

where to*, a and <p* are the characteristic magnitude, 
slope and normalization, respectively. The index £ refers 
to the cluster £ . In this section, we fixed the Schechter 
slope a to —1 because it is largely undetermined. 

Figure 4 shows the LF in each redshift bin. Data points 
and error bars are computed as usual (e.g. Oemler 1973). 
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Fig. 5. The different lines shows the apparent magnitude 
of a galaxy having [3.6] = 16.3 mag at z — 0.5, depending 
on stellar mass growth histories, as labelled, as well as 
for an unphysical model where galaxies have the same old 
age at all redshifts (dotted black line). Data points are 
m* values derived in sec 5.1. The red region marks the 
observed 68 per cent confidence bound of m* derived in 
sec 5.2. 

When the difference between the cluster and control field 
counts is negative the result cannot be plotted, because 
the logarithm function requires a positive argument. The 
solid curve is the LF of the composite dataset, obtained by 
fitting the unbinned counts in the composite cluster and 
control field directions. We do not fit the displayed data 
points shown in the figure, and the LF fit (parameter or 
error determination) makes no use of these data points 
and errors, and they are shown for display purposes only: 
we fit, as mentioned, unbinned counts in the cluster and 
control field directions. 

The 'data' points nicely follow the Schecther function. 
As redshift increases, the LF moves to the right, i.e. to* 
becomes fainter in the observer frame, by about 1.0 mag 
between 0.25 < z < 0.40 and 0.99 < z < 1.25. In all 
redshift panels also plotted, as a long-dashed curve, is a 
fit with to* held fixed to the value observed in the lowest 
redshift bin. High-redshift data reject a model with m* 
held fixed to the value observed in the lowest redshift bin. 

Figure 5 shows the expected apparent magnitude of a 
galaxy having [3.6] = 16.3 mag at z = 0.5, for some stellar 
mass growth histories. It shows that a minimum of 1.5 mag 
of fading is expected in apparent magnitudes going from 
z = 0.3 to z = 1.1, whereas only 1.0 mag of fading is ob- 
served. However, it is not obvious from this figure whether 
the data reject the various stellar mass growth histories 
and at what confidence. We will not pursue the statisti- 
cal computation using the approximate method just de- 
scribed, but using the rigorous method detailed in the next 
section. 



5.2. Adding rigour 

Deriving luminosity evolution after having assumed that 
it is equal to zero in each studied redshift bin (i.e. the 
approach of the previous section) makes use of a circular 
argument: the computation of the LF assumes a model 
for galaxy evolution, that unfortunately is precisely what 
the LF is used to measure. This occurs even for the tra- 
ditional manner in which the LF is computed: having ob- 
servations at different redshifts (look-back times) and de- 
siring to measure how galaxies evolve, we should count 
galaxies having a given absolute mag at, say, z — 1 with 
galaxies having the same absolute mag at, say, z = (and 
in such a case we assume that galaxy luminosity does not 
change with look-back time), or with galaxies having a 
different luminosity (and in such a case we assume a given 
evolution). And, if we know how the luminosity evolves 
(because this is needed to make the computation), there 
is no need to perform the experiment (why measure the 
LF?). 

Beside the logical inconsistency, the assumption of a 
level of evolution underestimates error bars by a signif- 
icant factor, as measured in an actual case by Andreon 
(2004). We stress that a rigorous method is required if 
one wants to be sure of the correctness of the result, and, 
when the number of members in a given mag bin, esti- 
mated from the difference of galaxy counts in the cluster 
and control field directions, leads to unphysical (negative) 
values. One should, instead, solve for background counts, 
for the LF and its evolution at the same time, without bin- 
ning the sample in redshift bins and without bin the data 
in magnitude bins. This can be achieved adopting a more 
complex model in which additional parameters account for 
the evolution, following the path in previous works (Lin 
et al. 1999, Blanton et al. 2003, Andreon 2004) and rein- 
forced in APG. 

Algeabrically, to* in Eqn. 1 is replaced by: 

to* = to* =0 5 + Am mode i -Q (z - 0.5) (2) 

where Am mode i = m mo dei - m mod ei, z =o.5- 
Thus, we make the to* model fainter by an amount 
given by the prediction appropriate for the adopted stel- 
lar mass growth history and cosmology and we allow a 
supplementary linear evolution (i.e. proportional to red- 
shift), normalizing everything to z = 0.5 to make correc- 
tions equal to zero at roughly the median redshift of our 
survey. 

Our Q has a different meaning from Q in Lin et al. 
(1999), Blanton et al. (2003) and Andreon (2004): here 
Q = means that galaxies form stars according to the 
model, whereas in the former works Q = means that the 
luminosity stays constant over time, which is, of course, 
unphysical. 

The sample consists of about 5500 galaxies, of which 
about 950 galaxies are in clusters, and the remaining are 
interlopers. 

To compute the LF, we start with model selection. We 
use two statistical tests: the likelihood ratio test (LRT) 



S. Andreon: The buildup of stellar mass function 



7 



lg(Mass) [M sol ] @ z = 0.5 
12 11.5 11 10.5 10 

i i I I I I i I I I I i i I I I I i I I I I I I i I I =r~i r 




-26 -24 -22 14 16 18 



M z=Q 5 m red. to z = 0.5 

Fig. 6. Best fit LF at z = 0.5. Left: LF of the whole sample. Right: LF of the lower and upper redshift halves (blue 
closed triangles and red open circles, respectively). The match of the two halves occurs only if the assumed evolution 
is the correct one. 



will inform us of how frequently one incorrectly rejects the 
null hypothesis, under the hypothesis that the null is true. 
The Bayesian Information Criteria (BIC, Swartz 1978; 
Lindlc 2004 provides a useful astronomical introduction) 
informs us about the relative evidence of two models. LRT 
cannot be used when regularity conditions required for its 
application do not hold (for example, compared models 
should be hierarchically nested, i.e. one model should be 
a particular case of a more general model). 

We compare the E model to the other models, all with 
fixed Q = 0, irrespective of the m* and a values. BIC pro- 
vides very strong support for the E stellar mass growth 
history (ABIC i; 5 for Sa, and larger values for the other 
models). Our statistical analysis offers the advantage of 
avoiding an unnecessary assumption about the value of the 
best fit parameters in order to identify the most likely evo- 
lutionary model: it is able to infer an evolutionary model 
without any assumption about the a value, while such an 
assumption was done in sec 5.1 and in previous works, 
because these studies were forced to fix the a parameter 
to derive the evolution of m* . Not fixing the a value, our 
approach is not affected by the known correlation between 
m* and a that plagued previous approaches. 

Figure 3 summarizes the above model comparison: the 
acceptable area (i.e. the constraint put on models by our 
data) is a part of the shaded (green) area plotted in Figure 
3, ie. the region between the E track (a good descrip- 
tion) and the Sa track (a bad one). The major differ- 
ence between the Sa track (rejected by the data) and E 
track begins at look-back times greater than 7 Gyr, i.e. 
at z > 0.85, and becomes large at look-back times greater 
than 7.5 Gyr (i.e. z <; 1). We can discriminate between 
the two models because we have 8 clusters at z > 0.85 



and 5 clusters at z > 0.99, where models differ the most 
(Fig 3). 

We now verify whether the model is a) too complex 
(too many unconstrain parameters) for the data in hand, 
and b) if the data requires that the E model should be 
updated with a better one. We compare the E model hav- 
ing fixed Q = and a = — 1 to more general E models 
with Q or a free. BIC and LRT both inform us that the 
simplest model is favored. Given the data in hand, the 
E model does not need to be refined by the addition of 
a linear (with redshift, i.e. a Q) term. Furthermore, the 
a parameter is largely unconstrained, quantifying what is 
qualitatively apparent in Fig. 4. 

BIC strongly rejects the unphysical universe (the track 
of an E with the present-day age at all redshifts). 

The final best fit model is therefore the one of an old 
and passively evolving population formed at high redshift, 
without any additional recent stellar mass growth (i.e. the 
E model and Q = 0). For a = — 1 (the best formal fit is 
a = —1.05, but with large error bars), we found w* =0 5 = 
16.30 ±0.10 mag, i.e. M* = -24.8,-24.9,-25.1,-25.3 
mag at 2 = 0.3,0.5,0.8,1.1. 

Figure 6 shows the rigorously derived best fit LF (the 
curve) at z = 0.5 and the data points. Having measured 
the luminosity evolution, we can now safely assume it in 
order to combine counts at different redshifts (look-back 
times). We normalize each individual LF by the model <p* 
value, and we then weight each cluster by its <j)* value, 
after having evolved magnitudes from the cluster redshift 
to our reference redshift, z — 0.5. In doing this convo- 
lution we keep only bins entirely included in the stud- 
ied magnitude range, for simplicity. As in literature ap- 
proaches, errors on the data points do not include data 
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combining errors (whereas they are accounted for in our 
rigorous derivation, see APG). There is a good agreement 
between the curve and the data points, meaning that the 
Schechter function is a good description of the LF over 
the observed magnitude range, and that the selected stel- 
lar mass growth history provides a good description of the 
observed evolution. The latter point is displayed in the 
right panel of Figure 6. It shows the LF by splitting the 
sample in two redshift halves, at median redshift. The two 
LFs share a common m*_ 5 , and this occurs only if we 
selected the right stellar mass growth history model: if the 
adopted model underestimates the luminosity evolution in 
one of the halves, two different (horizontally shifted) LFs 
would be observed (as we discuss later). This "result" is 
a visual check of the model selection already discussed. 

Figure 7 qualitatively shows why we have ruled out 
the evolution of a stellar population whose stellar mass 
growth history is appropriate for an Sa (or later types): 
it displays the LF of clusters at z < 0.85 and z > 0.85, 
separately, evolved to z = 0.5 assuming the stellar mass 
growth history of an Sa population. The ro* =0 5 of the 
two samples do not match, as emphasized by the bottom 
panel that shows the cumulative LF: the high-redshift LF 
is left-shifted (too bright) with respect to the low-redshift 
LF, contrary to the adopted hypothesis that evolution is 
well described by the adopted stellar mass growth history. 
The mass evolution allowed by the data is much quieter 
than that of an Sa galaxy. The model selection performed 
confirms this result rigorously, quantifying its statistical 
significance. 

Having identified the evolutionary model, we can con- 
vert absolute magnitudes into a stellar mass scale, us- 
ing the A4/L ratio of the model. The mass scale is 
shown as the upper abscissa in Fig 6. We found A4* = 
1.1,1.2,1.2,1.2 10 n M Q at z = 0.3,0.5,0.8,1.1, respec- 
tively, were M* = M*(M/ L) mo d e i (in the appropriate 
units). The statistical accuracy is 10 per cent, derived from 
the m* uncertainty only. The absolute value of the charac- 
teristic mass, A4*, depends on several key model parame- 
ters (e.g. the lower mass limit of the initial mass function, 
see Bell & de Jong 2001), while its evolution does not, as 
long as model parameters are redshift-independent. 

Thus, we found that the luminosity of our galaxies 
evolves as an old and passively evolving population formed 
at high redshift. Models with a prolonged stellar mass 
growth are rejected by the data with high confidence. The 
mass function does not change in the last 8 Gyr, corre- 
sponing to the two thirds of the current Universe age. 
The data also reject the need for a redshift-dependent de- 
scription of the evolution more accurate than a passively 
evolving population formed at high redshift (i.e. a Q =/= 
is rejected). The data also reject models in which the age 
of the stars is the same at all redshifts. The Schechter 
function well describes the data. 

It makes little sense to improve upon these constraints 
with our data alone, say to attempt to constrain the char- 
acteristic time scale of the mass growth because of a de- 
generacy: the same luminosity evolution may be produced 
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Fig. 7. Top panel: LFs, evolved to z = 0.5 assuming an 
Sa stellar mass growth history, of the lower and upper 
redshifts ranges (marked by blue closed triangles and red 
open circles for z < 0.85 and z > 0.85, respectively). The 
mismatch between the two LFs (circles are left-shifted) 
implies that the assumed stellar mass growth history is 
rejected by the data. Bottom panel: blow-up of the cumu- 
lated LF, better showing the LF mismatch. 



by an older, but longer, episode of star formation, or a 
younger but shorter one. 



5.3. Test on incompleteness 

In order to test the effect of a potential incompleteness 
of the sample at [3.6] ~ 18 mag, we cut the sample at 
12.5 < [3.6] < 17.5 mag, and we compute m* for the E 
model and Q = 0. We found m* — 16.54 ±0.11 mag, 
in good agreement with the value measured by consider- 
ing the larger sample [3.6] < 18 mag, 16.30 ± 0.10 mag. 
Application of BIC still favour the E model over Sa and 
later type ones, but now at lower significance (ABIC 3) 
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because of the reduced sample size. Tus, the potential in- 
completeness of the sample at [3.6] ~ 18 mag is too small 
to affect the results of our analysis. 

6. Discussion 

6.1. Comparison with previous works 

There are no LFs in the [3.6] band with which we can 
directly compare, and therefore, we compare our [3.6] LF 
with LFs derived at shorter wavelengths. Our measure of 
evolution at 3.6 /im is a luminosity-weighted measure of 
evolution. The advantage of the chosen band is that it 
is far less sensitive to sporadic star formation episodes 
involving a small fraction of the mass than similar deter- 
minations performed at shorter wavelengths, because the 
[3.6] band measures the flux emitted between the H (at 
z = 1.25) and K (at z ~ 0.1) band rest-frame. At these 
wavelengths the flux-weighted age of a simple stellar pop- 
ulation from Bruzual and Chariot (2003) is about 5 Gyr 
(e.g. Martin et al. 2005). Instead, in the B band rest-frame, 
the flux- weighted age is 1.5 Gyr, and drops by a factor of 
10 at A ~ 3000 A . Therefore, comparison of results ob- 
tained in different bands requires that we pay attention to 
the considered wavelengths. 

Luminosity evolution derived in the [3.6] and K bands 
can be directly compared, because both determinations 
are sensitive to the same long-lived stars. Toft et al. 
(2004) summarized the cumulative efforts in the literature 
(largely relying on de Propris et al. 1999) to determine the 
LF at z ij 0.1. Our data alone match in number and in 
rcdshift distribution this cumulate effort. Toft et al. (2004) 
and de Propris et al. (1999) both find that the redshift de- 
pendence of to* agrees with that of a passively evolving 
population formed at high rcdshift, as we have found and 
as also found by Kodama & Bower (2003) and Kodama 
et al. (2004). To quantitatively compare values derived in 
different bands, we need to determine m*_ 5 for the same 
slope adopted in the comparison work (a = —0.9, because 
most of the measurements have been performed with such 
a slope), and to convert our to* =0 5 from [3.6] to the K 
band. To perform the latter task we use model K — [3.6] 
colors provided by Grasil. The best fit value converted to 
the K band is m* =0 5 = 16.7 ±0.1, in good agreement 
with the value inferred from Fig. 11 in Toft et al. (2004) 
m* =0 5 = 16.6 ± 0.5 mag. Our measure has a five times 
better accuracy than the latter, because Toft et al. (2004) 
do not fit to* to their data, but simply force to* to be equal 
to the value observed by de Propris et al. (1998) at z = 0, 
thus inheriting its accuracy (±0.5 mag). As mentioned in 
the introduction, our analysis rules out several alternatives 
(a non-passive evolution, for example), whereas Propris et 
al. (1999) do not address the topic of model selection, per- 
haps because of the freedom in cosmological parameters 
at the time of their analysis. 

The LF study by Andreon et al. (2004) samples « U 
and B bands and their sample of clusters has a large over- 
lap (13 clusters) with the one studied in this paper. They 
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found that clusters arc composed of two populations, one 
that had evolved passively from Zf > 2, and one formed 
at lower rcdshift (zf < 1). The bands used trace, as men- 
tioned, almost instantaneous star formation more than the 
stellar mass growth studied in this paper. The lack of a 
detection of a secondary stellar mass growth episode at 
[3.6] micron combined with its detection at shorter wave- 
lengths implies that the mass involved in such episodes is 
small. Quantification of the mass involved will be reported 
elsewhere. 



For the measurement of the stellar mass evolution, 
most of the literature works compare mass estimates de- 
rived from different estimators at different rcdshifts, be- 
cause of the lack of similar measures (in the galaxy rest- 
frame) over a large redshift range. Instead, our measures 
are homogeneous and derived from a single estimator over 
the whole redshift range. Our mass evolution estimate 
may rely on the appropriateness of the adopted model 
(i.e. stellar mass growth history). However, we allow devi- 
ations from the model (several stellar mass growth histo- 
ries and non-null Q values), but the data rejected them. 
Furthermore, literature approaches also rely on stellar 
mass growth models, and to a larger degree than us, be- 
cause they assume that the stellar mass growth history 
of each individual galaxy is well described by a simple 
model, whereas we assume that the above holds true for 
the average galaxy. Averages, by definition, are smoother 
and better described by a smooth model than individual 
measures. 



Our results on evolution of the stellar mass in clus- 
ter galaxies are in broad agreement with the literature. 
Bundy, Ellis and Conselice (2005) find little evolution in 
the field from z ~ 1 to z ~ 0, with particular emphasis on 
bright (massive) objects better sampled in their (and in 
our) work. Our result also agrees with Kodama & Bower 
(2003) and Kodama et al. (2004), who claim that there 
has been little evolution in A4*, but with large uncertain- 
ties, by comparing its value measured in cluster candi- 
dates and the z ~ value (from Balogh et al. 2001), using 
heterogeneous mass estimators and data. Our sample of 
clusters at high redshift is larger than theirs (we have 6 
spectroscopically confirmed clusters at z > 0.99, vs 3 can- 
didate clusters in Kodama & Bower 2003 and 5 candidate 
clusters in Kodama et al. 2004), and our comparisons use 
the same mass estimator and uniform data. Furthermore, 
spectroscopic observations presented in a very recent pa- 
per (Yamada et al. 2005) suggest that at least two of the 
Kodama et al. (2003) clusters are instead line of sight 
superpositions. The cluster nature of all our clusters has 
been spectroscopically confirmed, and, for all but three 
clusters, also confirmed through the detection of the clus- 
ter x-ray emission. 
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6.2. Early assembly and formation time of cluster 
galaxies 

The evolution of the mass function of cluster galaxies only 
measures the evolution of the galaxy population as a whole 
and does not necessarily imply a direct correspondence 
to the evolution of individual galaxies. For example, the 
constancy of the mass function can be interpreted equally 
well as a combination of different and more complicated 
evolution of individual galaxies, some of which grow stellar 
mass (say, by mergers), and some that lose stellar mass. 
However, such a possibility is unlikely, because it requires 
two physical mechanisms with similar mass and time (i.e. 
redshift) dependencies, otherwise the stellar mass function 
would change. The simplest interpretation, supported by 
the existence of very massive {M > 10 n " 5 M©) galaxies in 
our clusters, is that the mass assembly of most of galaxies 
in clusters (sampled with the available data) was largely 
complete at z > 1.25. 

Fundamental plane and colour studies (e.g. Bower 
et al. 1992; Stanford et al. 1998; Kodama et al. 1998; 
Andreon 2003; Andreon et al. 2004a; van Dokkum & 
Stanford 2003 and references therein) suggest that there 
is little recent star formation in early-type or red galax- 
ies but does not tell us whether these galaxies have been 
completely assembled. As long as early- type or red galax- 
ies are not a minority population in our clusters, the ob- 
served constancy of the mass function from z = 1.25 to 
the present-day, as well as the old age of the stellar popu- 
lations, implies that these galaxies have been completely 
assembled, and not only that their stars are old. 

7. Summary 

We had measured the 3.6 /im luminosity evolution of 
about 1000 galaxies in 32 clusters at 0.2 < z < 1.25, 
without any a priori assumption about luminosity evolu- 
tion, i.e. in a logically rigorous way. Our data match in 
number and in redshift distribution the cumulated litera- 
ture effort thus far. The quality of the data allows us to 
derive the LF and mass evolution homogeneously over the 
whole redshift range, using a single estimator, at variance 
with previous determinations. We found that the luminos- 
ity of our galaxies evolves as an old and passively evolv- 
ing population formed at high redshift without any need 
for a further redshift-dependent evolution. Models with 
a prolonged stellar mass growth are rejected by the data 
with high confidence. Data also reject models in which the 
age of the stars is the same at all redshifts. Similarly, the 
characteristic mass function evolves as a passively evolving 
stellar population formed at high redshift. The Schechter 
function describes the galaxy luminosity function well. 
The characteristic luminosity at z = 0.5 is 16.30 mag with 
a 10 per cent uncertainty. 

Acknowledgements. The author would like to warmly thank 
Giovanni Punzi for his statistical advice, and the anonymous 
referee for his careful reading of the paper. This work used 



Minuit, by F. James. C. Lonsdale and S. Oliver are acknowl- 
edged for useful discussions. 

References 



Abell, G. O. 1958, ApJS, 3, 211 
Andreon, S. 2001, ApJ, 547, 623 
Andreon, S. 2003, A&A, 409, 37 
Andreon, S. 2004, A&A, 416, 865 

Andreon, S. 2005, in "The fabulous destiny of galaxies: 
Bridging past and present", ( astro-ph/0509627 ) 

Andreon, S., & Cuillandre, J.-C. 2002, ApJ, 569, 144 

Andreon, S., Lobo, C, & Iovino, A. 2004, MNRAS, 349, 889 

Andreon, S., Willis, J., Quintana, H., Valtchanov, I., Pierre, 
M., & Pacaud, F. 2004a, MNRAS, 353, 353 

Andreon, S., Willis, J., Quintana, H., Valtchanov, I., Pierre, 
M., & Pacaud, F. 2004b, in the proceeding of 'Rencontres 
de Moriond: Exploring the Universe. Contents and 
Structure of the Universe' (astro-ph/0405574 ) 

Andreon, S. et al. 2005a, MNRAS, 359, 1250 

Andreon, S. et al. 2005b, MNRAS, submitted 

Andreon, S., Punzi, G., Grado, A., 2005, MNRAS, 360, 727 

Avni Y., 1976, ApJ 210, 642 

Bahcall, N. A. 1977, ApJ, 217, L77 

Balogh, M. L., Pearce, F. R., Bower, R. G., & Kay, S. T. 2001, 

MNRAS, 326, 1228 
Bell, E. F., & de Jong, R. S. 2001, ApJ, 550, 212 
Bertin, E., & Arnouts, S. 1996, A&AS, 117, 393 
Bishop CM., 1995, Neural Networks for Pattern Recognition, 

Oxford UK: Oxford University Press 
Blanton, M. R. et al. 2003, ApJ, 592, 819 
Bower, R. G., Lucey, J. R., & Ellis, R. S. 1992, MNRAS, 254, 

601 

Bruzual, G., & Chariot, S. 2003, MNRAS, 344, 1000 

Bundy, K., Fukugita, M., Ellis, R. S., Kodama, T., & Conselice, 

C. J. 2004, ApJ, 601, L123 
Butcher, H., & Oemler, A. 1984, ApJ, 285, 426 
de Propris, R., Eisenhardt, P. R., Stanford, S. A., & Dickinson, 

M. 1998, ApJ, 503, L45 
de Propris, R., Stanford, S. A., Eisenhardt, P. R., Dickinson, 

M., & Elston, R. 1999, AJ, 118, 719 
Drory, N., Bender, R., Feulner, G., Hopp, U., Maraston, C, 

Snigula, J., & Hill, G. J. 2004, ApJ, 608, 742 
Ebeling, H., Edge, A. C, Fabian, A. C, Allen, S. W., Crawford, 

C. S., & Boehringer, H. 1997, ApJ, 479, L101 
Eisenhardt, P. R., et al. 2004, ApJS, 154, 48 
Fazio, G. G., et al. 2004, ApJS, 154, 10 
Fontana, A., et al. 2004, A&A, 424, 23 
Jeffreys, H. 1938, MNRAS, 98, 190 

Kodama, T., Arimoto, N., Barger, A. J., & Arag'on- 

Salamanca, A. 1998, A&A, 334, 99 
Kodama, T., & Bower, R. 2003, MNRAS, 346, 1 
Kodama, T., et al. 2004, MNRAS, 350, 1005 
Liddle, A. R. 2004, MNRAS, 351, L49 

Lin, H., Yee, H. K. C, Carlberg, R. G., Morris, S. L., Sawicki, 
M., Patton, D. R., Wirth, G., & Shepherd, C. W. 1999, 
ApJ, 518, 533 

Lonsdale, C. J., et al. 2003, PASP, 115, 897 

Lonsdale, C, et al. 2004, ApJS, 154, 54 

Martin, D. C, et al. 2005, ApJ, 619, LI 

Oemler, A. , Jr. 1974, ApJ, 194, 1 



S. Andreon: The buildup of stellar mass function 



11 



Pahre, M. A., Ashby, M. L. N., Fazio, G. G., & Willner, S. P. 

2004, ApJS, 154, 235 
Panuzzo, P., Silva, L., Granato, G. L., Bressan, A., & 

Vega, O. 2005, in "The Spectral Energy Distribution of 

Gas- Rich Galaxies: Confronting Models with Data", eds. 

C.C. Popescu and R.J. Tuffs, AIP Conf. Ser., in press, 

jastro-ph/0501464| | 
Pierre, M., Valtchanov, I., Altieri, B., et al., 2004, JCAP, 09, 

011 

Press, W. H., Flannery, B. P., & Teukolsky, S. A. 1986, 

Numerical Recipes, Cambridge: University Press, 1986 
Rakos, K. D., & Schombert, J. M. 1995, ApJ, 439, 47 
Sandage, A., Tammann, G. A., & Yahil, A. 1979, ApJ, 232, 
352 

Schechter, P. 1976, ApJ, 203, 297 
Schwarz G., 1978, Annals of Statistics, 5, 461 
Silva, L., Granato, G. L., Bressan, A., & Danese, L. 1998, ApJ, 
509, 103 

Stanford, S. A., Eisenhardt, P. R., & Dickinson, M. 1998, ApJ, 
492, 461 

Toft, S., Mainieri, V., Rosati, P., Lidman, C, Demarco, R., 

Nonino, M., & Stanford, S. A. 2004, A&A, 422, 29 
van Dokkum, P. G., & Franx, M. 2001, ApJ, 553, 90 
van Dokkum, P. G., & Stanford, S. A. 2003, ApJ, 585, 78 

Valtchanov, L, et al. 2004, A&A, 423, 75 

Yamada, T., et al. 2005, ApJ, in press (astro-ph/0508594) 
Wilks, S., 1938, Ann. Math. Stat. 9, 60 

Wilks, S., 1963, Mathematical Statistics (Princeton: Princeton 

University Press). 
Willis, J. et al., 2005, MNRAS, 363, 675 
Werner, M. W., et al. 2004, ApJS, 154, 1 
Zwicky, F. 1957, Morphological astronomy, Berlin: Springer 



